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Question 6 


Intent of Question 





The primary goals of this investigative task were to assess a student’s ability to (1) define a parameter and 
state a correct pair of hypotheses; (2) explain how a particular statistic measures skewness; (3) use the 
observed value of the statistic and a simulated sampling distribution to make a conclusion about the shape 
of the population; and (4) create a new statistic and explain how it measures skewness. 


Solution 
Part (a): 


7 


The parameter of interest is “ = population mean miles per gallon (mpg) of a particular car model. 


7 





[he null and alternative hypotheses are as follows: 


Ho: w=27 
Hz “<27 
Part (b): 


If the distribution is right-skewed, one would expect the mean to be greater than the median. 


sample mean 


Therefore the ratio should be large (greater than 1). 





sample median 


Part (c): 


Because we are testing for right-skewness, the estimated p-value will be the proportion of the 
simulated statistics that are greater than or equal to the observed value of 1.03. The dotplot shows that 
14 of the 100 values are more than 1.03. Because this simulated p-value (0.14) is larger than any 
reasonable significance level, we do not have convincing evidence that the original population is 
skewed to the right and conclude that it is plausible that the original sample came from a normal 
population. 


Part (d): 


maximum - median 





One possible statistic is — 
median - minimum 


If the distribution is right-skewed, one would expect the distance from the median to the maximum to 
be larger than the distance from the median to the minimum; thus the ratio should be greater than 1. 


Scoring 


Parts (a), (b), (c), and (d) are scored as essentially correct (E), partially correct (P), or incorrect (I). 
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Question 6 (continued) 
Part (a) is scored as follows: 
Essentially correct (E) if the student correctly states the hypotheses with a lower-tailed alternative 
hypothesis AND correctly defines the parameter of interest by referring to the mean of the population 
in context (e.g., population mean mpg, true mean fuel efficiency, mean mpg of all cars of this model, 
mean mpg of this car model). 


Partially correct (P) if only one component is correct (hypotheses or definition of parameter). 


Incorrect (I) otherwise. 


Notes: 

e If astudent attempts to define the parameter more than once (e.g., saying “the parameter is...” 
and then later saying uw = ...”), then these are treated as parallel solutions, and the worst attempt 
is scored. 


e Asymbol other than uw in the hypotheses must be explicitly and correctly defined to get credit for 
the parameter component. 
e If words are used in the hypotheses to describe the parameter, 
o the words do not count as the definition of the parameter, and 
o the words must be consistent with the definition of the parameter, 
or no credit is given for the parameter component. 


Part (b) is scored as follows: 


Essentially correct (EF) if the student states that large values (or values greater than 1) of the statistic 
indicate right-skewness AND justifies the answer with a correct statement of how right-skewness 
affects the relationship between the mean and median. 


Partially correct (P) if the student: 

e states that large values of the statistic indicate right-skewness BUT only argues that ina 
normal (or symmetric) distribution the ratio should be close to 1 (i.e., does not discuss the right- 
skewness) 

OR 

e makes a correct statement of how right-skewness affects the relationship between the mean 

and median BUT does not state that large values of the statistic indicate right-skewness 
OR 

e has reversed the relationship between mean and median in right-skewed distributions or 
reversed left- and right-skewness AND states that small values of the statistic indicate right- 
skewness. 


Incorrect (I) if the student says large values indicate right-skewness but gives no explanation or an 
incorrect explanation. 


Part (c) is scored as follows: 
Essentially correct (E) if the student states that it is plausible that the sample came from a normal 


population AND justifies the choice with specific numerical evidence from the dotplot describing the 
relative location of the value 1.03 (e.g., 14 percent of the values are above 1.03). 
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Question 6 (continued) 


Partially correct (P) if the student: 


e states that it is plausible that the sample came from a normal population AND justifies the 
choice by describing the relative location of the value 1.03 in the dotplot without specific 
numerical evidence (e.g., 1.03 is toward the middle of the distribution, 1.03 is within two 
standard deviations of the mean) 

OR 

e states that the sample came from a population that is skewed to the right AND justifies the 
choice by describing the relative location of the value 1.03 in the dotplot (e.g., only 14 percent 
of the values are above 1.03, 1.03 is in the tail of the distribution). 


Incorrect (I) if the student does not refer to the relative location of 1.03 in the dotplot. 


Note: 
Common incorrect responses include the following: 


Simply describing the shape of the dotplot to justify normality. 

Saying that the dots are centered around 1, so the sample came from a normal population. 
Arguing that 1.03 is close to 1 (without describing its relative position in the dotplot). 

Stating that the sample size (or number of samples) is large, so the distribution is normal, or that 
the sample size is too small to make a conclusion. 

Stating an answer with no explanation. 


Part (d) is scored as follows: 


Essentially correct (EF) if the student defines a reasonable statistic AND identifies the values that 
indicate right-skewness with a correct explanation of how the right-skewness affects the components 
of the statistic. 


Partially correct (P) if the student: 


e defines a reasonable statistic but fails to adequately justify the values that indicate right- 
skewness 
OR 
e does not define a reasonable statistic but uses values from the table (min, Q1, med, O03, max) to 
describe a reasonable method for identifying right-skewness (e.g., if the distance from the 
median to maximum is greater than the distance from the minimum to the median, the 
distribution is skewed to the right). 


Incorrect (I) if the method does not include a comparison (e.g., simply checking for outliers on one side 
only). 


Notes: 


The statistic must be a formula that produces a single numerical value. For example, (maximum - 
median) > (median - minimum) is not a statistic. 
Any statistic that uses summary values not in the table (such as the mean) is incorrect. 
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Question 6 (continued) 


a. _... Maximum-median ‘ 
Some other acceptable variations of the statistic ; a include the reciprocal (values 
median - minimim 


< 1 indicate right-skewness), reversing the order of subtraction in the numerator or denominator 
(values < -1 indicate right-skewness), and the difference of the numerator and denominator (values > 0 
indicate right-skewness). 





Some other acceptable statistics are listed below. For each statistic, values > 1 indicate right- 
skewness. Other variations, including reciprocals and statistics based on differences are also 


acceptable. 
QO1 + O3 
maximum - O03 3 2 
* Qt - minimum ’ median 





2 
’ median - O1 median 


minimum + maximum 
O3 - median 4 





Each essentially correct (E) response counts as 1 point, and each partially correct (P) response counts as 
¥% point. 


4 Complete Response 

3 Substantial Response 
2 Developing Response 
1 Minimal Response 


If a response is between two scores (for example, 22 points), use a holistic approach to determine whether 
to score up or down, depending on the strength of the response and communication. 
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SECTION II 
Part B 
Question 6 
Spend about 25 minutes on this part of the exam. 
Percent of Section I score—25 


Directions: Show all your work. Indicate clearly the methods you use, because you will be graded on the 
correctness of your methods as well as on the accuracy and completeness of your results and explanations. 


6. A consumer organization was concerned that an automobile manufacturer was misleading customers by 
overstating the average fuel efficiency (measured in miles per gallon, or mpg) of a particular car model. The 
model was advertised to get 27 mpg. To investigate, researchers selected a random sample of 10 cars of that 
model. Each car was then randomly assigned a different driver. Each car was driven for 5,000 miles, and the 
total fuel consumption was used to compute mpg for that car. 


(a) Define the parameter of interest and state the null and alternative hypotheses the consumer organization is 
interested in testing. 


M* aes population mean mp4 for ths weslet 
Hoe MM; 27 
Ho: M<27 


GO ON TO THE NEXT PAGE. 
-14- 
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AL 


One condition for conducting a one-sample t-test in this situation is that the mpg measurements for the 
population of cars of this model should be normally distributed. However, the boxplot and histogram shown 
below indicate that the distribution of the 10 sample values is skewed to the right. 


ee 


Number of Cars 
= NY WwW & 


23 33 
Miles per Gallon 


_samplemenn.. . What values of that statistic 
sample median 


(small, large, close to one) might indicate that the population distribution of mpg values is skewed to the 
right? Explain. 


TF We otdtte 9 large, Yen pye popalehe 
dis tnbution of mp) Values io ShemeD te He 
rays Lue Hat pe semple R a Simple 


ee cee sole) Smee yee oe yan ry preoc€ 
Sens clive | bess Ce53t—-+ Lo extreme val-es 
fin Pe wedoan it well be hiner the 
We welinn wher He shar vy bo fre male, 


and bas fhe fap \Wen le be larg @, 


(b) One possible statistic that measures skewness is the ratio 


GO ON TO THE NEXT PAGE. 
-15- 
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AS (c) Even though the mpg values in the sample were skewed to the right, it is still possible that the population L A3 
distribution of mpg values is normally distributed and that the skewness was due to sampling variability. 
To investigate, 100 samples, each of size 10, were taken from a normal distribution with the same mean and 
sample mean 


standard deviation as the original sample. For each of those 100 samples, the statistic : 
sample median 


calculated. A dotplot of the 100 simulated statistics is shown below. 





e 
6 
0.94 0.95 0.96 0.97 098 099 100 1.01 1.02 1.03 1.04 1.05 1.06 


sample mean/sample median 


sample mean 
sample median 
the dotplot above, is it plausible that the original sample of 10 cars came from a normal population, or do the 
simulated results suggest the original population is really skewed to the right? Explain. 


The gue bel resets Segond tb tee Big, semeyle 
tame From a > bee po ulin, There are 14 pork 
ont pf joo tat hae 4 clebote Seo 
later fon pee wire bans flab a valve 
of [.0% of greet bjald weeer Green 

14% of dhe LHe by aac? Fe neces |, 


rahe bile é 
ro ee foss the. 10% 


dahritdel pope les _ Genera j 
3 nodes de (eject a nok yy other's Se 
War ghee Lathton dot» ao previhe —- ne euner (2 
+b oa tat oe cre plo siole (Lt as, eu ee 


Senge Ca wk on - no cme | popa|e tres: 


In the original sample, the value of the statistic was 1.03. Based on the value of 1.03 and 





GO ON TO THE NEXT PAGE. 
-16- 
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AY AY 


(d) The table below shows summary statistics for mpg measurements for the original sample of 10 cars. 


nines Or | Matin [08 [im 





What values of that statistic might indicate that the distribution is skewed to the right? Explain. 


Values HL substeaball, renter ion | wel 

vadizale a destelautor chew? fo fe nyt 
stare 6X would Mdeeote Tut flu Actin 
helveen Qa ak te mnedvn = weed 
‘aad Hi. We fatwe bhebween fe 

Ath ee Oe 4 wet Nes fr he roybt 

fail 8 longer Yan Weg \eft what wd tes 
Gk wness, 


GO ON TO THE NEXT PAGE. 
-17- 
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B\ STATISTICS 
SECTION I a BI 
Part B 
_ Question 6 
Spend about 25 minutes on this part of the exam. 
Percent of Section I score—25 


Directions: Show all your work. Indicate clearly the methods you use, because you will be graded on the 
correctness of your methods as well as on the accuracy and completeness of your results and explanations. 


6. A consumer organization was concerned that an automobile manufacturer was misleading customers by 
overstating the average fuel efficiency (measured in miles per gallon, or mpg) of a particular car model. The 
model was advertised to get 27 mpg. To investigate, researchers selected a random sample of 10 cars of that 


model. Each car was then randomly assigned a different driver. Each car was driven for 5, 000 miles, and the 
total fuel consumption was used to compute mpg for that car. 


(a) Define the parameter of interest and state the null and alternative hypotheses the consumer organization is 
interested in testing. , 


pA the Men bone! ef iteenuy 


In niet pe, Bellan 
the pect itu, Cor 


Ho wert 
aw, Zt 


ot oll (“ro of 


model. 


GO ON TO THE NEXT PAGE. 
-14- 


© 2009 The College Board. All rights reserved. 
Visit the College Board on the Web: www.collegeboard.com. 


R2 One condition for conducting a one-sample t-test in this situation is that the mpg measurements for the 
population of cars of this model should be normally distributed. However, the boxplot and histogram shown 4 R2 
below indicate that the distribution of the 10 sample values is skewed to the right. 


-}— 


Number of Cars 
mm hw WwW ff 


23 33 
Miles per Gallon 


Sample mean What values of that statistic 
sample median 


(small, large, close to one) might indicate that the population distribution of mpg values is skewed to the 
right? Explain. 


A loge Pe. WA ald dicebe S be uness 
He f (6 ht be CanGe it diy re 19 


(b) One possible statistic that measures skewness is the ratio 


Ww 
4 ae ty Hr wit, bt ; +loen Pine Semple 


cen ronld [pe Polld Noh Hen the 
jample ein, meking Hee vabiv lopg ee 
thera (. 


GO ON TO THE NEXT PAGE. 
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B (c) Even though the mpg values in the sample were skewed to the right, it is still possible that the population 
% distribution of mpg values is normally distributed and that the skewness was due to sampling variability. 
To investigate, 100 samples, each of size 10, were taken from a normal distribution with the same mean an 


standard deviation as the original sample. For each of those 100 samples, the statistic Phanatic 
sample median 
calculated. A dotplot of the 100 simulated statistics is shown below. 
$ : 
0 38 $88 23 Qs . 
cocee cole Stqsssosstsse SeshessseSe S8S0,3  ofs 


@ 
0.94 0.95 0.96 097 0.98 0.99 1.00 101 1.02 1.03 104° 1.05 1.06 
sample mean/sample median 


In the original sample, the value of the statistic sapien was 1.03. Based on the value of 1.03 and 
sample median 


the dotplot above, is it plausible that the original sample of 10 cars came from a normal population, or do th 
simulated results suggest the original population is really skewed to the right? Explain. 


Tt 7 lowsible that he ary sol genple 


( an @ Ce. 2. = pee [7 opr (atrOn 


bereuse thy distr hatin 1% certerd ot | 

WHE To NO Shem, mad Phe diotribatin 
h moved ghee and HCG NS Godel, amines, Fe 

fF 7% eqeally (eedy fe get skecness 

i the lott 4 to the right. 


GO ON TO THE NEXT PAGE. 
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(d) The table below shows summary statistics for mpg measurements for the original sample of 10 cars. 6 B Y 


m Cini HO Pei 03 | Maxi | 


Choosing only from the summary statistics in the table, define a formula for a different statistic that 


measures skewness. 
Q, = medina 


medion Q, 





What values of that statistic might indicate that the distribution is skewed to the right? Explain. 


Lune tows val). adiede- sbeusness 

& ee yyot becanse Ff there 
me skew tHe right, Q, end the 
meditn wil be fete a pret Fhe 


the mediin end Q, making thee 
G Cate then 


Vb ba 


GO ON TO THE NEXT PAGE. 
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SECTION II 
-Part-B- 
Question 6 
Spend about 25 minutes on this part of the exam. 
Percent of Section II score-—-25 


Directions: Show all your work. Indicate clearly the methods you use, because you will be graded on the 
correctness of your methods as well as on the accuracy and completeness of your results and explanations. 


6. A consumer organization was concerned that an automobile manufacturer was misleading customers by 
overstating the average fuel efficiency (measured in miles per gallon, or mpg) of a particular car model. The 
model was advertised to get 27 mpg. To investigate, researchers selected a random sample of 10 cars of that 


model. Each car was then randomly assigned a different driver. Each car was driven for 5,000 miles, and the 
total fuel consumption was used to compute mpg for that car. 


(a) Define the parameter of interest and state the null and alternative hypotheses the consumer organization is 
interested in testing. 


| A whe & WW wr 
= Aw hu m 
Pret Of WiMar ——-) Ww oe yi; 


sil aaa 
Wy: kw 421 bas 


GO ON TO THE NEXT PAGE. 
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One condition for erm a een ag t-test in this situation is that the mpg measurements the 6 


popula ion-6 2no ' 
below indicate tha the dict bution of zt 10 nie values i is Sayed to the right. 


HL] 





Number of Cars 
= NHN WwW 


23 33 
Miles per Gallon 


_Sample mean_ . What values of that statistic 
sample median 


(small, large, close to one) might indicate that the population distribution of mpg values is skewed to the 
right? Explain. 


(b) One possible statistic that measures skewness is the ratio 


tS a yor Wate, NS SHOWA A We wg Ahe mean 


we ON cab Xn XW wed am, oo) Wh 


\ige, SMe wns oe 


Saag\e waetiann 


GO ON TO THE NEXT PAGE. 
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C3 (c) Even though the mpg values in the sample were skewed to the right, it is still possible that the population b Gq 3 
distribution of mpg values is normally distributed and that the skewness was due to sampling variability. 
To investigate, 100 samples, each of size 10, were taken from a normal distribution with the same mean and 


standard deviation as the original sample. For each of those 100 samples, the statistic ——- was 
sample median 


calculated. A dotplot of the 100 simulated statistics is shown below. 


2 
33 

Sense ese HH 33 @ $3 333. H ese 

0.94 0.95 0.96 097 098 0.99 1.00 1.01 1.02 1.03 104 1.05 1.06 


sample mean/sample median 


sample mean 
sample median 
the dotplot above, is it plausible that the original sample of 10 cars came from a normal population, or do the 
simulated results suggest the original population is really skewed to the right? Explain. 


In the original sample, the value of the statistic was 1.03. Based on the value of 1.03 and 


Ww Aga SUR he ong wah peMlaran cond naw 
Sving_ wean 

Yeoe yrorwal. 3b vernal, Aue 
Sumyat wealth 

We Y WA ee awk yok Weve wy tay wr be. 
contntd MAN WAX on iy Worm shyt. 


Who Should 


GO ON TO THE NEXT PAGE. 
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CH (d) The table below shows summary statistics for mpg measurements for the original sample of 10 cars. C CY 


nas | OF [ Median | G3 | Maxi | 
spo] as [ep | 


Choosing only from the summary statistics in the table, define a formula for a different statistic that 
measures skewness. 


\F 
\ median - www wn 4 \ vweAlan - waren | 






Saw sy WG Wega 


What values of that statistic might indicate that the distribution is skewed to the right? Explain. 


\ 


\medin — itn | \ wed ae — mw vn 


wen ye iti 1c Weel yo ad 0" He 
Vest Wedteh Wy weve UW +o he Iefte 


Gidv fF Aho Aye weve por polity 
wt wnuelagaecd 





yt 3 US »1¥ A le 
Xe of woliry a ec 


\2-,\ 4 \3 - tel, 


2127 a ve GO ON TO THE NEXT PAGE. 
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Question 6 
Overview 


The primary goals of this investigative task were to assess a student’s ability to (1) define a parameter and 
state a correct pair of hypotheses; (2) explain how a particular statistic measures skewness; (3) use the 
observed value of the statistic and a simulated sampling distribution to make a conclusion about the shape 
of the population; and (4) create a new statistic and explain how it measures skewness. 


Sample: 6A 
Score: 4 


In part (a) the student correctly defines the parameter of interest, including the concepts of mean and 
population in context, and correctly states the null and alternative hypotheses using standard notation. 

Part (a) was scored as essentially correct. In part (b) the student correctly states that large values of the 
statistic indicate that the distribution is skewed to the right and gives a justification that describes the 
relationship between the mean and median when the distribution is skewed right. Part (b) was scored as 
essentially correct. In part (c) the student correctly states that there is not convincing evidence that the 
original sample came from a skewed population and gives specific numerical evidence from the dotplot 

(‘14 points out of 100”). Although not necessary, the student goes on to correctly explain when there would 
be convincing evidence that the distribution was skewed right. Part (c) was scored as essentially correct. In 
part (d) the student provides a reasonable statistic to measure skewness. The student then correctly 
identifies the values of the statistic that indicate right-skewness and justifies the response by discussing how 
the components of the statistic are affected by right-skewness. Part (d) was scored as essentially correct. The 
entire answer, based on all four parts, was judged a complete response and earned a score of 

4 points. 


Sample: 6B 
Score: 3 


In part (a) the student correctly defines the parameter of interest, including the concepts of mean and 
population in context, and correctly states the null and alternative hypotheses using standard notation. 

Part (a) was scored as essentially correct. In part (b) the student correctly states that large values of the 
statistic indicate that the distribution is skewed to the right and gives a justification that describes the 
relationship between the mean and median when the distribution is skewed right. Part (b) was scored as 
essentially correct. In part (c) the student correctly says, “It is plausible that the original sample came from a 
normal population.” However, the justification provided is not based on the relative location of the value 1.03 
in the dotplot. Instead, the student simply describes the shape and center of the dotplot and concludes that it 
is equally likely to get skewness to the left or to the right when sampling from a normal population. This is a 
true statement, but it does not address the question about the shape of the original population. Part (c) was 
scored as incorrect. In part (d) the student provides a reasonable statistic to measure skewness. The student 
then correctly identifies the values of the statistic that indicate right-skewness and justifies the response by 
discussing how the components of the statistic are affected by right-skewness. Part (d) was scored as 
essentially correct. With three parts essentially correct and one part incorrect, the entire answer was judged a 
substantial response and earned a score of 3 points. 
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Question 6 (continued) 


Sample: 6C 
Score: 2 


In part (a) the student correctly defines the parameter of interest but incorrectly uses a two-sided alternative 
hypothesis. Part (a) was scored as partially correct. In part (b) the student correctly states the relationship 
between the mean and median when the distribution is skewed to the right and then concludes that large 
values of the statistic indicate that the distribution is skewed to the right. Part (b) was scored as essentially 
correct. In part (c) the student does not use the relative location of 1.03 to make a decision about the shape of 
the original population. The student simply states that this distribution is centered at 1, which is expected 
when sampling from a normal distribution. Although this is a true statement, it does not give any information 
about the shape of the original population. Part (c) was scored as incorrect. In part (d), the student does not 
provide a formula for a single statistic that measures skewness. Instead, the student provides an inequality. 
However, the student then correctly explains how the inequality could be used to evaluate (right) skewness. 
Part (d) was scored as partially correct. With one part essentially correct, two parts partially correct, and one 
part incorrect, the entire answer was judged a developing response and earned a score of 2 points. 
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